Methods for designing integrated circuits employing voltage scaling and integrated circuits designed thereby

ABSTRACT

Various embodiments of methods of designing an integrated circuit (IC). One embodiment of one such method includes: (1) generating a functional IC design, (2) determining a target clock rate for the functional IC design, (3) synthesizing a netlist from the functional IC design that meets the target clock rate, (4) determining a performance/power ratio from the netlist, (5) attempting to increase the performance/power ratio by changing at least one of a speed, an area and a power consumption in at least some noncritical paths in the netlist, and (6) implementing a layout of the IC from the netlist.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional of application Ser. No. 12/364,918, filed by Parker, et al., filed on Feb. 3, 2009, entitled “Methods for Designing Integrated Circuits Employing Voltage Scaling and Integrated Circuits Designed Thereby,” currently allowed for issuance and commonly assigned with this application and incorporated herein by reference. This application also claims the benefit of U.S. Provisional Application Ser. No. 61/126,881, filed by Parker, et al., filed on May 7, 2008, entitled “LSIL AVSO: A Novel Paradigm for Optimizing Performance, Power, Area, and/or Yield in Integrated Circuits,” also commonly assigned with this disclosure and incorporated herein by reference. This application is related to U.S. Pat. No. 8,024,694 to Jamann, et al., filed on Feb. 3, 2009, entitled “Systematic Benchmarking System and Method for Standardized Data Creation, Analysis and Comparison of Semiconductor Technology Node Characteristics,” commonly assigned with the invention and incorporated herein by reference and U.S. Pat. No. 8,281,266 to Jamann, et al., filed on Feb. 3, 2009 entitled “Systematic, Normalized Metric for Analyzing and Comparing Optimization Techniques for Integrated Circuits Employing Voltage Scaling and Integrated Circuits Designed Thereby,” commonly assigned with the invention and incorporated herein by reference. U.S. Pat. No. 8,024,694 has corresponding divisional applications, U.S. Pat. Nos. 8,307,324 and 8,539,423. U.S. Pat. No. 8,281,266 has a corresponding divisional application, U.S. patent application Ser. No. 13/599,549, currently pending.

TECHNICAL FIELD

The invention is directed, in general, to integrated circuits (ICs) and, more specifically, to methods for designing ICs employing voltage scaling and ICs designed using the methods.

BACKGROUND

Conserving resources, including energy, has become a pre-eminent objective in today's world. Manufacturers of ICs are sensitive to the need to improve the energy efficiency of their products. Those skilled in the pertinent art are aware that various measures may be taken in an electronic circuit to reduce its power consumption. One measure is to use cells (i.e., logic elements including devices, e.g., transistors) that leak less current when turned off. Another measure is to use a lower voltage to drive the cells. Unfortunately, using lower leakage current cells or lower drive voltages almost always reduces the speed at which signals propagate through the circuit. Consequently, the circuit may not operate as fast as needed or desired.

Area and yield are also important considerations in circuit design. IC fabrication cost generally decreases as IC substrate (“die”) size decreases. Increasing yield means decreasing scrap, which by definition reduces overall IC fabrication cost.

Circuit designers use electronic design automation (EDA) tools, a category of computer aided design (CAD) tools, to create a functional circuit design, including a register transfer logic (RTL representation) representation of the functional circuit design, synthesize a “netlist” from the RTL representation, and implement a layout from the netlists. Synthesis of the netlist and implementation of the layout involve simulating the operation of the circuit and determining where cells should be placed and where the interconnects that couple the cells together should be routed. EDA tools allow designers to construct a circuit, simulate its performance, estimate its power consumption and area and predict its yield using a computer and without requiring the costly and lengthy process of fabrication. EDA tools are indispensable for designing modern ICs, particularly very-large-scale integrated circuits (VSLICs). For this reason, EDA tools are in wide use.

One such EDA tool performs timing signoff. Timing signoff is one of the last steps in the IC design process and ensures that signal propagation speed in a newly-designed circuit is such that the circuit will operate as intended. Signals that propagate too slowly through the circuit cause setup violations; signals that propagate too quickly through the circuit cause hold violations. Setup or hold violations frustrate the logic of the circuit and prevent it from performing the job it was designed to do.

Timing signoff is performed with highly accurate models of the circuit under multiple sets of assumptions regarding expected variations, called “corners.” Process-voltage-temperature (PVT) corners are based on assumptions regarding variations in device operation from one IC to another, drive voltage and operating temperature. Resistance-capacitance (R, C, or RC) corners are based on assumptions regarding variations in one or both of interconnect resistance and capacitance from one IC to another. Conventional timing signoff identifies setup and hold violations in a “slow” PVT corner (in which process variations are assumed to yield relatively slow-switching devices, and drive voltage and operating temperature are such that device switching speeds are their slowest) and a “worst” RC corner (in which process variations are assumed to yield interconnects having relatively high resistance and capacitance). Conventional timing signoff also identifies hold violations in a “fast” PVT corner (in which process variations are assumed to yield relatively fast-switching devices, and drive voltage and operating temperature are such that device switching speeds are their fastest) and a “best” RC corner (in which process variations are assumed to yield interconnects having relatively low resistance and capacitance). Conventional signoff timing also takes on-chip variations (OCV), which are process variations occurring over the area of a given IC, into account using statistical methods.

Thus a fundamental tradeoff exists among speed and power consumption. Further considerations involve speed, power consumption, area and yield. These force the circuit designer to employ EDA tools, particularly timing signoff, to strike a delicate balance. Tempering the designer's zeal are the above-described process and environmental variations to which every production circuit is subject. These variations increase the degree to which the designer must ensure that production circuits work under real-world operating conditions and therefore the complexity of timing signoff.

SUMMARY

Various embodiments of methods of designing an IC. One embodiment of one such method includes: (1) generating a functional IC design, (2) determining a target clock rate for the functional IC design, (3) synthesizing a netlist from the functional IC design that meets the target clock rate, (4) determining a performance/power ratio from the netlist, (5) attempting to increase the performance/power ratio by changing at least one of a speed, an area and a power consumption in at least some noncritical paths in the netlist, and (6) implementing a layout of the IC from the netlist.

One embodiment of another such method includes: (1) generating a functional IC design, (2) determining a target clock rate for the functional IC design, (3) determining a target area for the functional IC design, (4) determining a target power consumption for the functional IC design, (5) determining whether the IC is to employ voltage scaling or adaptive voltage scaling, (6) synthesizing a netlist from the functional IC design that meets the target clock rate, (7) determining a performance/power ratio from the netlist, (8) attempting to increase the performance/power ratio by changing all of the speed, the area and the power consumption in the at least some noncritical paths in the netlist, and (9) implementing a layout of the IC from the netlist.

Another aspect of the invention provides an IC. One embodiment of the IC includes: (1) functional circuitry located in at least one drive voltage domain, (2) at least one PVT monitor and at least one thermal monitor located in the at least one domain, (3) a voltage management unit configured to receive output signals from the at least one PVT monitor and the at least one thermal monitor and determine at least one drive voltage for the at least one domain based thereon, and (4) a regulator coupled to the voltage management unit and configured to provide the at least one drive voltage.

The foregoing has outlined certain aspects and embodiments of the invention so that those skilled in the pertinent art may better understand the detailed description of the invention that follows. Additional aspects and embodiments will be described hereinafter that form the subject of the claims of the invention. Those skilled in the pertinent art should appreciate that they can readily use the disclosed aspects and embodiments as a basis for designing or modifying other structures for carrying out the same purposes of the invention. Those skilled in the pertinent art should also realize that such equivalent constructions do not depart from the scope of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the invention, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:

FIG. 1 is a graph of PVT corners showing, in particular, traditional PVT corners with respect to an IC that does not employ voltage scaling;

FIG. 2 is a graph of device drive voltage and device speed showing, in particular, PVT corners relevant to voltage scaling;

FIG. 3 is a graph of device drive voltage and device speed showing, in particular, performance failure, hold/hazard failure and safe operating zones for an IC employing voltage scaling;

FIG. 4 is a block diagram of various embodiments of an IC employing an AVS architecture;

FIG. 5 is a flow diagram of one embodiment of a method of designing an IC employing voltage scaling; and

FIG. 6 is a flow diagram of another embodiment of a method of designing an IC employing voltage scaling.

DETAILED DESCRIPTION OF CERTAIN ASPECTS AND EMBODIMENTS

Voltage scaling, referred to in the Summary above, is a technique whereby the drive voltage to a particular IC is modulated to one or more particular values such that the IC can function properly. Voltage scaling is particularly suited to compensate for process variations. Static voltage scaling may be performed at the factory (e.g., during calibration) or before the IC begins normal operation (e.g., during powerup initialization). In contrast, adaptive voltage scaling (AVS) is performed continually while the IC is in normal operation and particularly effective at compensating for temperature variations and device aging as well as process variations. ICs can have one or more domains, each having its own voltage regulator. Drive voltage can therefore be modulated separately in each domain, allowing compensation for OCV to be carried out as well.

While voltage scaling (including AVS) is known, it has heretofore been used only to compensate for process and temperature variations and aging in an IC that has been designed by a conventional method. What has not been realized until now, however, is that voltage scaling has the potential to change the fundamental theory under which an IC operates, and that, accordingly, the method by which an IC is designed may be transformed to take full advantage of the benefits of voltage scaling. Consequently, introduced herein are novel methods of designing ICs such that their performance, area, power consumption, yield or any combination of these may be realized beyond the limits of current design methods.

Also introduced herein is an AVS architecture that is capable of adapting drive voltage independent of “system software,” defined as external, user or application software (including firmware) that executes in the IC. System software is typically loaded into an IC following its delivery to a customer. In other words, system software does not need to take into account, or be involved in, the adaptation of drive voltage. Consequently, such software does not have to be modified to execute in an IC that employs the novel AVS architecture. Those skilled in the software art often call this trait “orthogonality” or “transparency;” the novel AVS architecture is orthogonal or transparent to the application software.

FIG. 1 is a graph of PVT corners showing, in particular, traditional PVT corners with respect to an IC not employing voltage scaling. FIG. 1 shows a “slow” PVT corner 110 in which process variations are assumed to yield relatively slow-switching devices, and drive voltage and operating temperature are such that device switching speeds are their slowest. FIG. 1 also shows a “fast” PVT corner 120 in which process variations are assumed to yield relatively fast-switching devices, and drive voltage and operating temperature are such that device switching speeds are their highest. The slow and fast PVT corners 110, 120 represent extremes. Setup violations result from signals propagating too slowly and arriving too late for subsequent use and are most likely to occur at the PVT slow corner 110. Hold violations result from signals propagating too quickly and arriving too soon to be sustained for subsequent use and are most likely to occur at the fast PVT corner 120. Conventional timing analysis is performed at the slow and fast PVT corners 110, 120, since they represent the greatest challenge to IC operation. In fact, it has been found that most timing issues occur in a region 130 proximate and perhaps including the slow PVT corner 110.

It has also been determined that ICs operating with voltage scaling need not be subjected to timing analysis at the slow and fast PVT corners 110, 120. Voltage scaling renders the slow and fast PVT corners 110, 120 irrelevant. An IC employing voltage scaling (and particularly AVS) does not operate in these corners. Instead, as will be shown, other PVT corners bound the operation of an IC employing voltage scaling. As stated above, it has been realized that the method by which an IC is designed may be modified to take advantage of this fact. Instead of selecting circuit configurations (e.g., architectures and datapath widths) and devices and closing timing at the slow and fast PVT corners 110, 120 (and conquering the region 130), the IC design process can instead focus on more fundamental design objectives: power, performance, area, yield or any combination of these.

FIG. 2 is a graph of device drive voltage and device speed showing, in particular, PVT corners relevant to voltage scaling. FIG. 2 shows a first PVT corner 210 in which process variations are assumed to yield relatively slow-switching devices and temperatures are such that device switching speeds are at their slowest. However, under such conditions, voltage scaling compensates for this inadequate speed by setting drive voltage at its highest level, increasing speed to an acceptable level. FIG. 2 also shows a second PVT corner 220 in which process variations are assumed to yield relatively fast-switching devices and temperatures are such that device switching speeds are at their highest. However, under such conditions, voltage scaling compensates for this excessive speed by setting drive voltage at its lowest level, decreasing speed to an acceptable level. Given OCV and temperature variations over the area of an IC, a region 230 results. It has been determined that IC design efforts are best spent on optimizing performance in the region 230. In one embodiment, IC design efforts are focused exclusively in the region 230.

The method introduced herein can also be applied in a reduced risk manner by creating extended safe-zones of operation. In addition, AVS can be employed to introduce over-drive (e.g., to about 110% V_(DD)) and under-drive (e.g., to about 90% V_(DD)). Of course, other ranges of over- and under-drive may be employed in alternative embodiments. Furthermore, over-and under-drive need not be the same. AVS gives the IC designer the ability to choose a desired optimization target in a safe-zone as will now be shown.

FIG. 3 is a graph of device drive voltage and device speed showing, in particular, performance failure, hold/hazard failure and safe operating zones for an IC employing voltage scaling. FIG. 3 shows the first and second PVT corners 210, 220 of FIG. 2. FIG. 3 also shows third and fourth PVT corners 300, 310. The third PVT corner 300 represents a nominal drive voltage V_(DD) applied to an IC in which process variations are assumed to yield relatively slow-switching devices and temperatures are such that device switching speeds are at their slowest. The fourth PVT corner 310 represents a nominal drive voltage V_(DD) applied to an IC in which process variations are assumed to yield relatively fast-switching devices and temperatures are such that device switching speeds are at their highest. A span 320 represents a range of over-drive, and a span 330 represents a range of under-drive, “over-drive” and “under-drive” being defined with reference to the nominal drive voltage V_(DD). Thus, the first, second, third and fourth PVT corners 210, 220, 300, 310 define a safe zone 340 of operation for an IC within which AVS is capable of scaling drive voltage to maintain proper IC operation.

A performance failure zone 350 lies below the safe zone 340 and encompasses operating conditions in which setup failures would occur. A hold/hazard failure zone 360 lies above the safe zone 340 and encompasses operating conditions in which hold failures would occur. An operating line 370 representing the points of actual operation of a particular IC lies within the safe zone 340 and is, as expected, bounded on its ends by the first and second PVT corners 210, 220. The operating line 370 divides the safe zone 340 into a lower, performance margin zone 380 and an upper, hold/hazard margin zone 390. The lower, performance margin zone 380 represents a margin by which the operating line 370 is separated from the performance failure zone 350. This margin comes at the cost of performance: performance is lower, and power and area are higher, than optimal. The upper, hold/hazard margin zone 390 represents a margin by which the operating line 370 is separated from the hold/hazard failure zone 360. This margin comes at the cost of additional buffering needed to hold signals pending subsequent use.

The graph of FIG. 3 reveals several aspects of optimization that may be exploited. First, as an upwardly pointing arrow to the right of the graph of FIG. 3 indicates, the performance of the IC may be increased, or the area of the IC may be decreased, by reducing the performance margin zone 380. Second, as a downwardly pointing arrow to the right of the graph of FIG. 3 indicates, the power consumed by the IC may be decreased by reducing the hold/hazard margin zone 390. Consequently, optimizing the design of an IC in terms of performance, power and area amounts to minimizing the width (indicated by a line 395) of the safe zone 340. As a result, power consumption may be reduced (fast devices can operate at a lower drive voltage), smaller cells (of less area) may be used, further reducing power consumption to meet the same performance (slow devices can operate at a higher drive voltage), and the performance of the IC can be increased by avoiding the slow PVT corner 110 of FIG. 1, allowing the IC to be run faster than previously (slow devices can operate at a higher drive voltage). Furthermore, IC design and test time (“turn-around-time”) can be decreased due to reduced CAD tool run-times and ease in achieving existing performance requirements. Process variations may also be reduced, and yields increased, in IC designs implemented at non-worst-case PVT corners.

One metric that may be employed to evaluate the degree to which a particular IC design has been optimized is a performance/power ratio. An early benchmark indicates that one embodiment of the method described herein applied to an IC designed with 40 nm technology can realize about a 130% increase in the performance/power ratio over a conventional design method. Another embodiment of the method was applied to a test-case to understand the benefits realized in the 65 nm-LP and 65 nm-G+ technologies.

TABLE 1 Benefits Realized in 65 nm-LP and 65 nm-G+ Technologies Power Performance Reduction Area Reduction Gain Beyond Beyond Beyond Traditional Traditional Traditional Technology Worst-Case Worst-Case Worst-Case 65 nm-LP Not optimized +17% +18% 65 nm-LP +50% −30% −13% 65 nm-G+ Not optimized. +42% +16% 65 nm-G+ +27% −5% −6%

Table 1, above, indicates a completely new optimization space beyond that offered by conventional IC design methods.

As stated above, one aspect of the invention is directed to an architecture that allows AVS to be carried out in such a way that it is transparent or orthogonal to system or application software. One embodiment of the architecture is implemented as a fully integrated, self-governing, closed-loop system. Another embodiment of the architecture is implemented in minimalist form as an open-loop solution. Several variations are possible between these two extreme implementations by selecting components of the architecture. In conjunction, the degree of system software control can be tuned from zero-intervention to full-control, and variations in between. While the flexibility enables a range of solutions, these solutions should be managed with reference to the components of the architecture employed in a particular IC design.

FIG. 4 is a block diagram of various embodiments of an IC 400 employing an AVS architecture. As described above, an AVS architecture is introduced that is transparent or orthogonal to external software. Such software can gain access to certain embodiments of the architecture, allowing the software to receive AVS data or control AVS. The IC 400 has N domains, N being any integer number.

The IC 400 has a voltage management unit (VMU) 410. The VMU 410 is configured to set one or more drive voltages V_(DD) within a range V_(min) to V_(max) based on signals received from one or more monitors in each of the N domains. The VMU 410 typically sets the drive voltage V_(DD) in each domain at the lowest level necessary (subject to a small safety margin) to guarantee that signal propagation speeds are adequate to avoid setup errors. This allows the IC 400 to operate at a reduced (e.g., minimal) power consumption level. The VMU 410 includes voltage management (VM) logic 411 configured to execute algorithms for interpreting signals from the monitors and scaling voltage in response thereto. A PVT monitor interface 412 is configured to provide an interface for one or more PVT monitors (to be described below). A thermal monitor interface 413 is configured to provide an interface for one or more thermal monitors (to be described below). A host processor interface 414 is configured to provide an interface for a host processor (to be described below). A Joint Test Experts Group (JTAG) interface 415 is configured to provide an interface for a JTAG port (to be described below). A voltage regulator interface 416 is configured to provide an interface for one or more voltage regulators (to be described below). A voltage monitor interface 417 is configured to provide an interface for a voltage monitor (to be described below).

FIG. 4 explicitly illustrates three circuit blocks: a circuit block 1 420-1, a circuit block 2 420-2 and a circuit block N 420-N. Each circuit block has functional circuitry associated with it, namely functional circuits 421-1, 421-2, . . . , 421-N. The functional circuits 421-1, 421-2, . . . , 421-N carry out the useful functions that the IC 400 has been designed to perform. For example, if the IC 400 is a microprocessor, the functional circuits 421-1, 421-2, . . . , 421-N may include an address decoder, an arithmetic and logic unit, a floating point unit, a register file and read-only memory for storing microcode. If the IC 400 is a mixed-signal application-specific IC (ASIC), the functional circuits 421-1, 421-2, . . . , 421-N may include analog-to-digital converters (ADCs), digital-to-analog converters (DACs), and various circuits for manipulating analog and digital signals. The functional circuits 421-1, 421-2, . . . , 421-N of the various circuit blocks 420-1, 420-2, . . . , 420-N may differ in size, function, configuration or in any other respect.

Each circuit block 420-1, 420-2, . . . , 420-N also has various monitors associated therewith. PVT monitors 422-1, 422-2, . . . , 422-N may respectively associated with the circuit blocks 420-1, 420-2, . . . , 420-N and coupled to the PVT monitor interface 412. The PVT monitors 422-1, 422-2, . . . , 422-N are configured to produce signals that are based on their respective device speeds and may take the form of ring oscillators.

Thermal monitors 423-1, 423-2, . . . , 423-N may also respectively associated with the circuit blocks 420-1, 420-2, . . . , 420-N and coupled to the thermal monitor interface 413. The thermal monitors 423-1, 423-2, . . . , 423-N are configured to produce signals that are based on their respective temperatures.

Critical path detectors (CPDs) 424-1, 424-2, . . . , 424-N (sometimes called critical path monitors, or CPMs) may also respectively associated with the circuit blocks 420-1, 420-2, . . . , 420-N and are coupled to a host processor 430. The host processor 430 is in turn coupled to the host processor interface 414 of the VMU 410. The CPDs 424-1, 424-2, . . . , 424-N are configured to produce signals that are based on the actual arrival times of signals in critical paths relative to when they are supposed to arrive. In one specific embodiment, the CPDs 424-1, 424-2, . . . , 424-N provide a binary response if a fixed margin from the critical path is transgressed. The outputs of the CPDs 424-1, 424-2, . . . , 424-N are OR-ed and brought as a single signal to the VMU 410 so that the VMU 410 can take evasive action in the event a margin transgression occurs. Various embodiments of CPDs are set forth in U.S. patent application Ser. No. 12/247,992, filed by Chakravarty on Oct. 8, 2008, entitled “Critical Path Monitor for an Integrated Circuit and Method of Operation Thereof,” commonly assigned herewith and incorporated herein by reference.

In the embodiment of FIG. 4, the CPDs 424-1, 424-2, . . . , 424-N provide a binary response if a fixed margin from the critical path is transgressed. The outputs of the CPDs 424-1, 424-2, . . . , 424-N are OR-ed and brought as a single signal to the VMU 410 so that the VMU 410 can take evasive action in the event a margin transgression occurs. Note that this is optional; the CPD input of the VMU 410 can be hardwired to suppress this feature if it is not required in a specific embodiment.

As the functional circuits 421-1, 421-2, . . . , 421-N operate, the PVT monitors 422-1, 422-2, . . . , 422-N, thermal monitors 423-1, 423-2, . . . , 423-N and the CPDs 424-1, 424-2, 424-N generate signals indicating device speeds, temperatures and critical path speeds in their respective circuit blocks 420-1, 420-2, . . . , 420-N. As stated above, the VM logic 411 receives these signals and makes decisions regarding the scaling of drive voltages to be supplied to the circuit blocks 420-1, 420-2, . . . , 420-N.

A JTAG port 440 is coupled to the JTAG interface 415. The JTAG port 440 allows the VMU 410 to be tested. Those skilled in the pertinent art are familiar with the use of JTAG or other boundary scan testing techniques that may be employed to test circuitry.

A voltage regulator unit 450 is coupled to the voltage regulator interface 416 of the VMU 410. The voltage regulator unit 410 is configured to receive voltage control signals from the VMU 410 and scale drive voltages supplied to the circuit blocks 420-1, 420-2, . . . , 420-N. FIG. 4 shows four different alternative embodiments of the voltage regulator unit 450. In a first embodiment, the voltage regulator unit 450 includes a fully integrated (on-chip) regulator 451 configured to receive one or more voltage regulator signals and generate one or more corresponding drive voltages. In a second embodiment, the voltage regulator unit 450 includes an integrated (on-chip) power controller 452 configured to receive one or more voltage regulator signals and drive one or more external (off-chip) power transistors 461 to generate one or more corresponding drive voltages. In a third embodiment, the voltage regulator unit 450 includes a generic commercial regulator interface 453 configured to receive one or more voltage regulator signals and provide them to one or more external (off-chip) commercial regulators 462 to generate one or more corresponding drive voltages. In a fourth embodiment, the voltage regulator unit 450 includes a system-specific regulator interface 454 configured to receive one or more voltage regulator signals and drive one or more specific off-chip system regulators 463 to generate one or more corresponding drive voltages.

Irrespective of the embodiment, a voltage monitor 470 monitors the one or more drive voltages and provides a feedback signal through the voltage monitor interface 417 of the VMU 410. References 480 provide voltage or current references as needed to form a basis for comparison for, among other things, the PVT monitors 422-1, 422-2, . . . , 422-N, thermal monitors 423-1, 423-2, . . . , 423-N and the CPDs 424-1, 424-2, . . . , 424-N and the voltage monitor 470.

While the illustrated embodiment of the AVS architecture is capable of operating transparently or orthogonally with respect to system software executing in the IC 400, it also allows some software intervention. For example, software may monitor and gather data from internal registers of the VMU 410. System software may be allowed override the VMU 410 and take direct control over voltage scaling. This may be employed, for example, to implement frequency-dependent voltage scaling. System software may also be able to initiate or perform diagnostics with respect to the AVS architecture.

FIG. 5 is a flow diagram of one embodiment of a method of designing an IC employing voltage scaling. The method begins in a start step 505. In a step 510, performance objectives are determined. The performance objectives may be expressed in terms of a target data throughput, a target clock frequency, a target die size, a target overall power consumption, a target yield percentage or any other conventional or later-determined performance objective. In a step 515, an optimization target voltage is determined. For example, a particular IC design may have an optimization target voltage of 1.7V. However, all optimization target voltages are within the scope of the invention. In a step 520, a decision is made as to whether voltage scaling is needed. The performance objectives defined above may be such that additional voltage scaling (or AVS) circuitry may not be needed. If voltage scaling is not needed, a conventional IC design method may then be employed.

However, the flow diagram of FIG. 5 assumes that voltage scaling is needed. In a step 525, a decision is made as to whether the voltage scaling is to be static (non-AVS) or adaptive (AVS). The result of the decision of the step 525 determines the PVT corners and libraries that are to be used in synthesizing a netlist. In a step 530, a functional IC design and a register transfer logic (RTL representation) representation of that IC design are generated. Those skilled in the pertinent art understand how to generate a functional IC design and an RTL representation based thereon.

In a step 535, the RTL representation is synthesized to yield a netlist using the optimization target voltage. Those skilled in the pertinent art are familiar with the construction and content of libraries of IC devices in general and are aware that such libraries contain standard implementations, along with physical attributes, of devices that can be implemented in an IC. Some attributes are largely independent of fabrication process variation, including the numbers and locations of device terminals, the shape and size of the device footprint and the numbers and types of process steps that should be undertaken to fabricate the device and process-dependent attributes. Other attributes vary, such as the switching speed of the device (if it is a transistor), the drive voltage of the device, the current-handling capability of the device and the power consumption of the device. As described above, the process-dependent attributes of the library are determined with reference to PVT corners; the PVT corners of a library employed to design an IC that implements voltage scaling are different from those of a conventional library. In general, since voltage scaling renders conventional, more extreme, PVT corners irrelevant, design margins can be relaxed, and greater flexibility exists with respect to the selection of devices to be used in an IC.

During the synthesis of the RTL representation into the netlist, fundamental decisions may be made regarding the architecture of the IC, including its logic circuits. Those skilled in the pertinent art understand that logic circuits may be optimized in different ways. An IC may need to perform a multiply function. However, that multiply function may be implemented with different multiplier architectures. Multipliers having wide datapaths (parallel units) may be faster but consume more power and area than multipliers having narrower datapaths (e.g., a single unit with intermediate result feedback). The RTL representation may be generated with reference to a library containing more than one architecture for various logic circuits, and choices among those architectures may be made based on the greater latitude afforded by static voltage scaling or AVS.

In a step 540, devices are placed, a clock tree is synthesized, and routing is determined according to the RTL representation and at the optimization target voltage. In a step 545, a timing signoff is performed at the optimization target voltage. Those skilled in the pertinent art understand how to perform timing signoff at a given drive voltage. The method ends in an end step 550.

FIG. 6 is a flow diagram of another embodiment of a method of designing an IC employing voltage scaling. The method begins in a start step 605. In a step 610, performance objectives are determined. As above, the performance objectives may be expressed in terms of a target data throughput, a target clock frequency, a target die size, a target overall power consumption, a target yield percentage or any other conventional or later-determined performance objective. In a step 615, an optimization target voltage is determined. For example, a particular IC design may have an optimization target voltage of 1.7V. However, all optimization target voltages are within the scope of the invention. In a step 620, a decision is made as to whether voltage scaling is needed. The performance objectives defined above may be such that additional voltage scaling (or AVS) circuitry may not be needed. If voltage scaling is not needed, a conventional IC design method may then be employed.

However, the flow diagram of FIG. 6 assumes that voltage scaling is needed. In a step 625, a decision is made as to whether the voltage scaling is to be static (non-AVS) or adaptive (AVS). The result of the decision of the step 630 determines the PVT corners and libraries that are to be used in generating a functional IC design and a corresponding RTL representation. In a step 630, a functional IC design and corresponding RTL representation are generated. In a step 635, a candidate netlist is synthesized from the RTL representation. In a step 640, critical paths in the candidate netlist are identified. In a step 645, the speeds of critical paths are increased. In a more specific embodiment, the speed of each critical path is increased until the path is only barely critical. In a step 650, the areas of noncritical paths are decreased. In a step 655, the power consumption of noncritical paths is decreased. In a step 660, devices are placed, a clock tree is synthesized, and routing is determined according to the RTL representation and at the optimization target voltage. In a step 665, a timing signoff is performed at the optimization target voltage. The method ends in an end step 670.

Those skilled in the art to which the invention relates will appreciate that other and further additions, deletions, substitutions and modifications may be made to the described embodiments without departing from the scope of the invention. 

What is claimed is:
 1. A method of designing an integrated circuit, comprising: generating a functional integrated circuit design; determining a target clock rate for said functional integrated circuit design; synthesizing a netlist from said functional integrated circuit design that meets said target clock rate; determining a performance/power ratio from said netlist; attempting to increase said performance/power ratio by changing at least one of a speed, an area and a power consumption in at least some noncritical paths in said netlist; and implementing a layout of said integrated circuit from said netlist.
 2. The method as recited in claim 1 further comprising determining a target area for said functional integrated circuit design.
 3. The method as recited in claim 1 further comprising determining a target power consumption for said functional integrated circuit design.
 4. The method as recited in claim 1 further comprising determining whether said integrated circuit is to employ voltage scaling or adaptive voltage scaling.
 5. The method as recited in claim 1 wherein said attempting comprises attempting to increase said performance/power ratio by changing all of said speed, said area and said power consumption in said at least some noncritical paths in said netlist.
 6. The method as recited in claim 1 wherein said attempting comprises attempting to increase said performance/power ratio by changing said at least one of said speed, said area and said power consumption in all of said noncritical paths in said netlist.
 7. The method as recited in claim 1 wherein said attempting is carried out only with respect to true noncritical paths in said netlist.
 8. An integrated circuit designed by the method as recited in claim
 1. 9. A method of designing an integrated circuit, comprising: generating a functional integrated circuit design; determining a target clock rate for said functional integrated circuit design; determining a target area for said functional integrated circuit design; determining a target power consumption for said functional integrated circuit design; determining whether said integrated circuit is to employ voltage scaling or adaptive voltage scaling; synthesizing a netlist from said functional integrated circuit design that meets said target clock rate; determining a performance/power ratio from said netlist; attempting to increase said performance/power ratio by changing all of said speed, said area and said power consumption in said at least some noncritical paths in said netlist; and implementing a layout of said integrated circuit from said netlist.
 10. The method as recited in claim 9 wherein said attempting comprises attempting to increase said performance/power ratio by changing said at least one of said speed, said area and said power consumption in all of said noncritical paths in said netlist.
 11. The method as recited in claim 9 wherein said attempting is carried out only with respect to true noncritical paths in said netlist.
 12. An integrated circuit designed by the method as recited in claim
 9. 13. An integrated circuit, comprising: functional circuitry located in at least one drive voltage domain; at least one PVT monitor and at least one thermal monitor located in said at least one domain; a voltage management unit configured to receive output signals from said at least one PVT monitor and said at least one thermal monitor and determine at least one drive voltage for said at least one domain based thereon; and a regulator coupled to said voltage management unit and configured to provide said at least one drive voltage.
 14. The integrated circuit as recited in claim 13 further comprising at least one critical path detector located in said at least one domain, said voltage management unit further configured to receive output signals from said at least one critical path monitor and determine said at least one drive voltage based thereon.
 15. The integrated circuit as recited in claim 13 wherein said regulator includes an integrated regulator.
 16. The integrated circuit as recited in claim 13 wherein said regulator includes an integrated power controller.
 17. The integrated circuit as recited in claim 13 wherein said regulator includes a commercial regulator interface.
 18. The integrated circuit as recited in claim 13 wherein said regulator includes a system-specific interface.
 19. The integrated circuit as recited in claim 13 further comprising a host processor coupled to said voltage management unit. 